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[57] ABSTRACT 

A two-bit Booth multiplier circuit performs two-bit multi- 
plication iterations using a single adder while retaining the 
same data path width and the multiplicand multiples of a 
single-bit Booth multiplier circuit. The two-bit Booth mul- 
tiplier circuit halves the number of multiplier iterations of a 
single-bit multiplier. A multiplier circuit includes an adder 
having a first input terminal, a second input terminal, and an 
output terminal and a plurality of shift registers. The regis- 
ters include a multiplicand register having an output termi- 
nal connected to the first input terminal of the adder, a 
temporary shift register having an output terminal connected 
to the second input terminal of the adder and having an input 
terminal connected to the output terminal of the adder, and 
a multiplier shift register having an input terminal connected 
to the output terminal of the adder and the multiplier shift 
register and having an output terminal. The multiplier circuit 
further includes a shift logic connected between the adder 
and the plurality of shift registers and a Booth logic circuit 
having an input terminal connected to the output terminal of 
the multiplier shift register and having an output terminal 
connected to the shift logic. The Booth logic circuit controls 
a two-bit Booth multiplier operation designating multipli- 
cand partial product operations including +0 and ±x partial 
product operations and implementing ±2x partial product 
operations using the ±x multiplicand partial product opera- 
tions while shifting data from the temporary shift register 
one bit right on application to the adder. 

20 Claims, 5 Drawing Sheets 
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TWO-BIT BOOTH MULTIPLIER WITH What is needed is a multiplier circuit and operating 

REDUCED DATA PATH WIDTH method that increases operating speed while retaining the 

compact circuit size of a single-multiplier iterative multi- 

BACKGROUND OF THE INVENTION plication circuit. 

1. Field of the Invention ^ SUMMARY OF THE INVENTION 

The present invention relates to computer arithmetic In accordance with the present invention, a two-bit Booth 

circuits and operating methods. More specifically, the inven- multiplier circuit performs two-bit multiplication iterations 

tion relates to a two-bit Booth multiplier circuit. ^^i^S ^ ^^"ele adder while retaining the same data path width 

2. Description of the Related Art lo 

FIG. 1, which is labeled prior art, illustrates a block of a single -bit multiplier, 

diagram of a simple Booth muhipher 100 for signed inte- in accordance with an embodiment of the present 

gers. The numbers x=x„_iX„_2. . . Xq and y=y„_iY„_2' • • yo invention, a multiplier circuit includes an adder having a first 

are respectively placed into register X 102 and register Y input terminal, a second input terminal, and an output 

104 for multiplication by the Booth multiplier 100 and a is terminal and a plurality of shift registers. The registers 

register P 106 is initially set to zero. A multiplication include a multiplicand register having an output terminal 

operation is executed in a sequence of two-part iterations. In connected to the first input terminal of the adder, a tempo- 

a first part, register Y 104 containing bits Y„_jy„_2. . . yo is rary shift register having an output terminal connected to the 

added to register P 106 when the least significant bit of second input terminal of the adder and having an input 

register X 102 is 1. Register P 106 is left unchanged when 20 terminal connected to the output terminal of the adder, and 

the least significant bit of register X 102 is 0. In a second ^ multiplier shift register having an input terminal connected 

part, register P 106 and register X 102 are shifted right with ^^e output terminal of the adder and having an output 

the low-order bit of register P 106 moving to the high-order terminal. ITie multiplier circuit further includes a shift logic 

bit of register X 102 and the low-order bit of register X 102 connected between the adder and the plurality of shift 

discarded. After n iterations, the multiplication product is 25 registers and a Booth logic circuit having an input terminal 

held in register P 106 and register X 102 with the least- connected to the output terminal of the multiplier shift 

significant bits held in register X 102. register and having an output terminal connected to the shift 

n J- . o 1.- 1- • V. • c 1 f logic. The Booth logic circuit controls a two-bit Booth 

Booth recoding in a Booth multiplier circuit is useful for , • i . • j • • i • i * i ■ , , 

1 ™ *• 1.' r *u . . multip her operation designating multiplicand partial product 

implementing a multiplication operation that operates on \ - i ^ j • i j • 

. f. I J • J u 1^ • J 30 operations including +0 and ±x partial product operations 

both on unsigned and signed numbers. For unsigned j . , * , j . • 

multiplication, the unsigned numbers are converted to ""^ 'mplementing ±2x partial product operaUo^ using the 

signed numbers by appending a zero value bit to the left of mull.phcand partial product operations whi e shifting 

, ■ c . r \ c u V data from the temporary shift register one bit ngbt on 

the most significant bit (x„.i) of a number x=X„.i x^.^. . . --.n-ation to the adder 

Xq. Accordingly, a set of rules are defined for implementing appucaiion to me aaaer. 

a Booth multiplier 100 assuming that register X 102 initiaUy ^5 Many advantages are achieved by the described Booth 

holds a number x=x„ ,x„ . . Xo and a zero value bit is multipher circuit and operating method. One advantage is 

appended to the right of the least significant bit (xo) of the "'^^ the described Booth multiplier circuit has a reduced 

number x=x„ , x„ . . Xo for a Booth encoding. The rules "f*^.", ,^'!'' '^^^''^'^ complexity. It is advantageous that 

applied to the Boolh multiplier where x, , is initially 0 and ^P^'^J^l handhng of ±2x partial product operations reduces 

the ith multiply iteration the low-order bit of register X 102 ^^'^ P*'^ °^ ""^ S""* multiplier. Advantageously, the 

is X are as follows" Booth multiplier circuit only utiuzes ±x and +0 partial 

A\ ,r ^ , ^ . , 1 ^ „ -.^^ product operations rather than +0, +2x and ±x partial prod- 

(1) If x,=0 and x,_,=0, then add 0 to register P 106. operations of other two-bit Booth multiplier embodi- 

(2) If x-0 and x,_i«l, then add register Y 104 to register rnents. The reduced number of partial product operations 
^ 45 advantageously allows the selection of the partial product 

(3) If Xi-1 and x^.^^O, then subtract register Y 104 from using a three-to-one multiplexer, rather than a five-to-one 
register P 106. multiplexer. Usage of the three-to-one multiplexer improves 

(4) If x,«l and x^.^^l, then add 0 to register P 106. the operating speed of the Booth multiplier and reduces 
A simple technique for implementing the rules for the loading on the Booth multiplier. The two-bit shift operation 

Booth multiplier 100 involves extending the register X 102 so advantageously halves the number of iterations that are used 

one bit to the right to define a new bit x.^. Two's comple- for a Booth multiply operation using a single adder, 

ment multiplication is implemented by taldng into consid- g^j^p dhSCRIPTION OF THE DRAWINGS 
eration the sign of the value in register P 106 while the 

register P 106 is shifted right and saving the most recently ^he features of the described embodiments believed to be 

shifted bit of register X 102 for usage in deciding whether ss "^^^^ are specifically set forth in the appended claims, 

to add or subtract register Y 104 from register P 106. However, embodiments of the invention relating to both 

Various considerations are taken into account in the structure and method of operation, may best be understood 

design of a multiplier circuit. For example, if operating referring to the following description and accompanying 

speed is paramount, a multiplier circuit typically includes a drawings. 

plurality of adders forming a multiplier array. A full-range 60 I^IG. 1, which is labeled prior art, illustrates a block 

multiplier array generaUy includes an adder for each of the diagram of a simple Booth multiplier for integer numbers, 

bits in a multiplicand. Unfortunately, a multiplier circuit FIG. 2 is a schematic block diagram showing a two-bit 

having a plurality of adders requires a large circuit area. Booth multiplier circuit in accordance with an embodiment 

In an implementation of a multiplier where the design of the present invention, 

takes circuit area considerations into account, a single adder 65 FIG. 3 is a high-level schematic block diagram illustrating 

may be used under control of a state machine or software to a multimedia multiprocessor system in accordance with an 

perform a multiply operation in a sequence of iterations. embodiment of the present invention. 
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FIG. 4 is a schematic block diagram showing a multime- 
dia signal processor included in the multimedia multipro- 
cessor system illustrated in FIG. 3. 

FIG. 5 is a schematic block diagram which illustrates a 
control processor in the multimedia multiprocessor system, 5 

DETAILED DESCRIPTION OF TIIE 
PREFERRED EMBODIMENTS 

Referring to FIG. 2, a schematic block diagram illustrates 
an embodiment of a two-bit Booth multiplier circuit 200 ]o 
which performs two-bit multiplication iterations using a 
single adder 202, while conserving width of a data path 204, 
In a two-bit Booth multiplier two bits are retired during each 
iteration, in comparison to one bit that is retired by a 
conventional Booth multiplier, so that only half the itera- is 
tions of a conventional Booth multiplier are performed. The 
adder 202 generates a 32-bit resuU R<31:0> which forms the 
32-bit data path 204. The two-bit Booth multiplier circuit 
200 performs a multiplication operation of a datum held in 
a multiplicand register x<31 :0> 206 and a datum held in two 20 
shift registers including a temporary register z<3l:0> 210 
and a multiplier shift register y<31:0> 208. Although the 
two-bit Booth multiplier circuit 200 is illustratively shown 
as a 32-bit multiplier circuit with a 32-bit multiplicand 
register x<31:0> 206 and a 32-bit multiplier shift register 25 
y<31:0> 208, other sizes of circuits may also be imple- 
mented including, but not limited to, 8-bit, 16-bit, 64-bit, 
256-bit and other size circuits. Other embodiments may 
implement a multiplier size that is not a multiple of 2. The 
result of the multiplication operation is stored in the multi- 30 
plier shift register y<31:0> 208 and a temporary shift 
register z<31:0> 210, where the temporary shift register 
z<31:0> 210 holds the most significant bits of a result 
R<63:0> and the multiplicand register x<31:0> 206 holds 
the least significant bits of the result R<63:0>, 35 

The adder 202 is a 32-bit adder with a first input temainal 
connected to the multiplicand register x<31:0> 206 through 
a multiplicand multiplexer 212 and a second input terminal 
connected to the temporary shift register z<3l:0> 210 
through a result select multiplexer 214. Ilie adder 202 has a 40 
third input terminal connected to a negative x multiplexer 
220. The negative x multiplexer 220 has first and second 
input terminals for receiving logic 0 and logic 1 values under 
control of a select -x signal from the Booth logic circuit 218. 
The negative x multiplexer 220 is used to add a 1 to one's 45 
complemented data from the multiplicand register x<31:0> 
206 to supply a two's complement number. The adder 202 
has an output terminal that is connected to a result set 
multiplexer 216 supplying result data bits R<31:1> and 
connected to a multiplier multiplexer 222 supplying result 50 
data bits R<1:0>. Temporary data z<31:0> from the result 
set multiplexer 216 are selected by control signals from a 
Booth logic circuit 218 and written to the temporary shift 
register z<31:0> 210. I^ast significant result data bits 
R<1:0> from the adder 202 are also connected to an input 55 
terminal of a multiplier multiplexer 222. The multiplier 
multiplexer 222 also has an input terminal connected to the 
temporary shift register z<31:0> 210 to receive the least 
significant bit z<0>. Control of each multiplication iteration 
is performed by a Booth logic circuit 218. The Booth logic 60 
circuit 218 registers the control signals internally in a 
register 224. The Booth logic circuit 218 is connected to the 
multiplier shift register y<31:0> 208 and to the multiplier 
multiplexer 222 to receive data which directs the state of the 
Booth logic circuit 218 and is applied to control terminals of 65 
the multiplicand multiplexer 212, multiplier multiplexer 
222, and result select multiplexer 214 for controlling opera- 



tion of the Booth multiplier 200. The Booth logic circuit 218 
is connected to the multiplier shift register y<31:0> 208 to 
receive three bits y<3:l> and connected to the multiplier 
multiplexer 222 to receive two bits yD<l:0> concatenated 
with a logic 0 bit for initial Booth encoding, where 
yj3<31:0> is a multipher data word prior to shifting by the 
multiplier shift register y<31:0> 208. 

The Booth logic circuit 218 receives the three bits y<3:l> 
and the two bits yjr,<l:0> concatenated with a logic "0" to 
control the grouping by the Booth multiplier 200 of the 
multiplier y<31:0> into two-bit groups. The three bits 
received by the Booth logic circuit 218 include two bits of 
the multiplier signal y and a most significant bit of a lower 
order group of bits. In the first iteration of the Booth 
multiplier 200, the three control bits used to control the 
Booth logic circuit 218 includes multiplicand data bits 
y£,<l:0> concatenated with a logic "0", For subsequent 
iterations of the Booth multiplier 200, the three control bits 
for controlling the Booth logic circuit 218 are the three 
multiplier register bits y<3:l>. In the illustrative embodi- 
ment of the Booth multipher 200, an assumption is made 
that the multiplier y<31:0> is loaded immediately preceding 
the first Booth iteration. Otherwise, the first Booth iteration 
is performed using input signals including two multiplier 
register bits y<l:0> concatenated with a logic "0". 

The Booth logic circuit 218 includes a decoder (not 
shown) which decodes a two-bit input signal including 
Booth bit<l> and Booth bit<0>. The Booth bits <1:0> are 
selected from the two bits y<3:2> and the two bits y^<l:0> 
according to TABLE I, as follows: 

TABLE I 
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TABLE I illustrates the partial product operation that is 
performed for each iteration of the two-bit Booth multiply 
operation. The partial product operation is selected from 
among the partial product operations including +0, ±x and 
±2x. In one embodiment of a Booth multiplier circuit, the 
±2x operation disadvantageously extends the data path one 
extra bit. For example, in the embodiment illustrated by FIG. 
2, the ±2x operation generally increases the data path from 
32 bits to 33 bits. 

Extension of the data path is advantageously avoided by 
special handling of ±2x partial product operations. The +2x 
partial product operation is avoided by noting that the result 
of a +2x operation is an appending of a zero value bit to the 
least significant bit of the result from the adder 202 and a left 
shifting of the multiplicand x<31:0> by one bit position. 
Accordingly, the least significant bit of the accumulated 
partial product is the least significant bit of the right-shifted, 
two-bit accumulated partial product of the previous cycle so 
that one fewer bit is passed through the adder 202 for ±2x 
partial product operations without extending the data path 
204 one extra bit. 

The -2x partial product operation is avoided by noting 
that -2x is equal to 2 times -x, which is further the same as 
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2 times the quantity x+1. Accordingly, the partial product The shift right 1 signal is also apphed to the multiplier 

-2x is equal to the quantity x+1, shifted left by one bit. As multiplexer 222 to select which data bits of the result data 

a resuh, least significant bit handling is the same for the +2x R<31:0> and the temporary data z<31:0> are written to the 

partial product operation and the -2x partial product opera- multiplier shift register y<31:0> 208. Result data bits 

tion. 5 R<1:0> and temporary data bit z<0> are connected to the 

In summary, the adder 202 receives the input data input terminal of the multiplier multiplexer 222. When the 

x<31:0> from the multiplicand multiplexer 212 for the case shift right 1 signal is asserted, result data bit R<0> is written 

of the +2x partial product operation. The adder 202 receives to the most significant bit y<31>, temporary data bit z<0> is 

the input data -x<31:0> from the multiplicand multiplexer written to bit y<30>, and multiplier data bits y<31:2> are 

212 and adds 1 directly to the adder 202 for the case of the shifted right two positions and written to bits z<29:0> of 

-2x partial product operation. For the i2x partial product multiplier shift register y<31:0> 208. The shift right 1 signal 

operations, the least significant bit in the accumulated partial ^ asserted to replace the ±2x partial product operations, 

product is dropped. For the ±2x partial product operations. When the shift right 1 signal is not asserted, result data bits 

the adder 202 avoids the one-bit left shift operation that is R<1:0> are written to the most significant bits z<31:30> of 

used for the ±x partial product operations. In this manner, the ^5 the multiplier shift register y<31:0> 208 and multiplier data 

Booth multipher 200 maintains the same data width in the bits y<31:2> are shifted right two positions and written to 

adder 202 for ±2x and ±x partial product operations. bits z<29:0> of multiplier shift register y<31:0> 208. 

Advantageously, the multiplicand multiplexer 212, which The ±2x partial product operations are replaced by apply- 

generates the partial products for the Booth multiplier 200, ing ±x partial product operations to the first input terminal 

only selects ±x and +0 partial product operations using a 20 °f ^^e adder 202 and dividing the data applied to the second 

three-to-one multiplicand multiplexer 212 rather than +0, input data by two. Specifically, the shift right one bit signal, 

±2x and ±x partial product operations requiring a five-to-one when asserted, is applied as a control signal to the result 

multiplexer. select multiplexer 214 which shifts the bits of z<31:0> right 

The Booth logic circuit 218 generates control signals one position as the data word is directed to the adder 202. By 

including the select 0, select +x, select -x, and the shift right 25 shifting prior to the adder 202, the data path from the adder 

1 control signals, llie select 0, select +x, and select -x 202 to the registers Z<31:0> and y<31:0> is maintained at 

signals are apphed to the multiplicand multiplexer 212 to 32-bits in width, rather than the 33 bits which would 

select the partial product operation applied to the adder 202, generally be necessary for a 2-bit Booth multiplier, 

thereby determining whether the first input terminal of the When the sum is accumulated in the temporary shift 

adder 202 receives a zero data value, the x register value 30 register z<31:0> 210 and the multiplier shift register 

from the multiplicand register x<31:0> 206 or the negated x( y<31 :0> 208, data is shifted right only one bit when the shift 

X) register value from the multiplicand register x<31:0> 206 right one bit signal is asserted in comparison to a two-bit 

via an inverter (one's complement circuit) 224. The select shift when the shift right one bit signal is de asserted. The 

-X signal is applied to the adder 202 to selectively add 1. operation of shifting the temporary shift register z<31:0> 

Adding 1 converts the one's complement of x<31:0> into 35 210 data right one bit advantageously allows the width of the 

two's complement form. data path 204 to be maintained at 32-bits. 

The shift right 1 signal is applied to the result set To initialize the Booth multipher circuit 200, a multipli- 

multiplexer 216 to select which data bits of the result data cand x<31;0> is written to the multiplicand register x<31:0> 

R<31:0> are written to the temporary shift register z<31:0> 206. The initialization signal is also apphed as a control 

210. Result data bits R<31:1> are connected to the input 40 signal to the result set multiplexer 216 and the multiplier 

terminal of the result set multiplexer 216. When the shift multiplexer 222. The initialization signal causes the result 

right 1 signal is asserted, result data bit R<31>is written to set multiplexer 216 to initiaUze the temporary shift register 

the most significant bit z<31> of the temporary shift register z<31:0> 210 to a zero value. The initialization signal causes 

z<31:0> 210 and resuh data bits R<31:1> are shifted right the multiplier multiplexer 222 to write a multiplier y<31:0> 

and written to bits z<30:0> of the temporary shift register 45 to the multiplier shift register y<31:0> 208. The initiahza- 

z<31:0> 210. llie shift right 1 signal is asserted to replace tion has a duration of two clock cycles, 

the ±2x partial product operations. When the shift right 1 Referring to FIG, 3 a high-level schematic block diagram 

signal is not asserted, result data bit R<31> is written to the illustrates a multimedia multiprocessor system 300 which 

most significant bit z<31> and to bit z<30> of the temporary implements an illustrative technique for synchronizing 

shift register z<31:0> 210 and result data bits R<31:2> are 50 audio and video frames in an MPEG presentation system, 

shifted right two bits and written to bits z<29:0> of the The multimedia multiprocessor system 300 includes a host 

temporary shift register z<31:0> 210. The two-bit shift processor 302 and a multimedia signal processor 304. A 

operation advantageously halves the number of iterations typical host processor 302 is an x86 processor such as a 

that are used for a Booth multiply operation using a single Pentium"^" or Pentium Pro''" processor. The host processor 

adder. 55 302 executes programs based on instructions and data held 

The shift right 1 signal is applied to the result select in a system memory 316. The host processor 302 commu- 

multiplexer 214 to select which data bits of the temporary nicates with the multimedia signal processor 304 via a 

data z<31:0> from the temporary shift register z<31:0> 210 system bus 306 such as a PCI bus. The multimedia signal 

are loaded to the adder 202. Temporary data bits z<31 :0> arc processor 304 interfaces to various functional blocks such as 

connected to the input terminal of the result select muhi- 60 an audio and communication CODEC 308, a video A/D 

plexer 214, When the shift right 1 signal is asserted, tem- converter 310, a video D/A converter 312, and a frame buffer 

porary data bit z<31> is loaded to the most significant bit of SDRAM memory 314. 

the z<31:0> input to the adder 202 and temporary data bits Referring to FIG. 4, a schematic block diagram shows the 

z<31;l> are shifted right and input to bits z<30:0> of the multimedia signal processor 304 within the multimedia 

adder 202. When the shift right 1 signal is not asserted, 65 multiprocessor system 300. UTie multimedia signal processor 

temporary data bits z<31:0> are directly input to bits 304 includes a digital signal processor (DSP) core 402 

z<31:0> of the adder 202. which is connected to a plurality of multimedia interfaces. 



09/30/2004, EAST version: 1.4.1 



5,958,000 

7 8 

The DSP core 402 is the computation engine of the storage and 4 Kbyte of data storage for the vector processor 

multimedia signal processor 304 and includes a scalar 406, and a shared 16 Kbyte of integrated instruction and data 

processor 404, a vector processor 406, a cache subsystem cache ROM for both the scalar processor 404 and the vector 

408, a fast bus (FBUS) 410, and an I/O bus 412. The scalar processor 406. The cache subsystem 408 interfaces to the 

processor 404 is a scalar processor based on a 32-bit 5 scalar processor 404 through 32-bit data buses and interfaces 

ARM?™ control processor which is designed and tnanu- ,0 the vector processor 406 through 256-bit data buses. The 

factured by ARM Limited, Great Britain, and performs ^ache ROM 418 includes uROM initialization software, 

general processing functions such as real-time operatmg diagnostics software, various system management 

systern operations, interrupt and exception handling input/ ^^^^-^^^ ^ ^^^^^ ^^1^^,^^ 

output device managetnent, commumcation with the host j^,^ ^^^^^^^ Specifically, the cache ROM 418 

processor 302 and he like^In one embodiment, the scalar ^^^^^^ instruction exception handler and input and 

processor 404 opera es at 40 MHz^ lTie scalar processor 404 ^^^.^^ .^^ f,,^^,^''^ 0. 1, 2 and 3 for the scalar 

mieriaces to ine vector processor 4U(, through a coprocessor ^^^^^ 404. ^.,^he ROM 418 also includes a vector 
interlace 442 

. ' processor interrupt handler and a vector processor break- 

The multimedia signal processor 304 performs multiple ,5 ^^^^^ exception handler which execute in the scalar proces- 

various multimedia operations. One multimedia operation is ^qj. 

audio decoding so that the scalar processor 404 and the ,^r»TTc Am • . r . 1 v. n^niTo • u 

^ . 1 • . -.1. J The FBUS 410 interfaces to a plurality of FBUS periph- 

vector processor 406, in combination with program codes ^^^^ ^ ^^[^^.^ p^/^^ .^,^^j^> ^ ^ 

which operate on the processors, form an audio processor ^ g^.^j^ ^^^^ ^^J^^ ^^^^^^^^^ ^^2, an 8-channel 

^noisnown;. 20 DMA controller 424, a customer ASIC logic block 426, and 

u fn»^® °- ^' ' ^"^^T.T block diagram Illustrates 3 ^ ^ata mover 428. The PCI bus interface 420 

the ARM? scalar processor 404 which ^ controlled by an interfaces to the system bus 306 and operates, for example, 

mstruction decoder and control logic 502. The scalar pro- ^t 33 MHz. The customer ASIC logic block 426 furnishes 

cessor 404 communicates with the cache subsystem 408 via ^.^ntrol logic for implementing custom functionality, as 

a write data register 5M and an instruction pipehne and read 25 ^^^-^^^ J^^^ ^^^tomer ASIC logic block 426. in one 

dat^i register 506 llie scalar processor 404 mcludes an embodiment, supplies 10 Kgates including interfaces to 

address register 508 and an address incrementer 510 or ^^-^^ ^^^j cODECs and customer-specific I/O devices, 

addressing data m a 31x32-bit register bank 512. The scalar ^h^ ^ j^,^ ^^^^^ 428 transfers DMA data from the 

T u f togic such as a 32-bit ^^^^ processor 302 to SDRAM memory 430 which is local 

ALU 514, a barrel shifter 516 and a Booth s multiplier 518. 30 to the multimedia signal processor 304. 

The coprocessor interface 442 is coupled directly to the , . , „ , , ■■. r i/^ u 

instruction decoder and control logic 502 via nOPC, nCPI, ^ ^'.^ "^'"^^f * '° P'"^»l"y °f ^'^ 

CPAand CPB signal lines that communicate operation codes ^"'J^^"^^^' " ^'f^am processor 432 

,*«ot„.^f,-«„ 7™ u^*,„^»„ ♦u^ Af\A a UART serial line 434, a timer circuit 436, an interrupt 

and instruction arguments between the scalar processor 404 , .-^ , ' ^^,v . * 

„ . , ^n^: *u u .u • * controller 438, and a special register 440. The bit stream 

and the vector processor 406 through the coprocessor inter- t. 

face 442 r processor 432 processes the video bit stream. The special 

„ - * . . , ^rv^ ■ t register 440 is used for software-controlled initialization and 

Refernng again to FIG. 4, the vector processor 406 is the inlerrupt handling, 

digital signal processing engine of the multimedia signal . , . , ..... 

processor 304. The vector processor 406 has a Single- While the invention has been described with reference to 

Instruction Multiple-Data architecture and includes a pipe- 40 embodiments, it will be understood that these 

lined RISC engine that operates on multiple data elements in embodiments are 1 lustrative and that the scope of the 

parallel to perform signal processing functions such as invention is not limited to them. Many variations. 

Discrete Cosine Transforms (DCT), FIR filtering, modifications, additions and improvemen^^^^ 

convolution, video motion estimation and other processing "^^""'^ described are possible^For example the embodiments 

operations. The vector processor 406 supports vector arith- 45 ^^"^""^^ ^J^'^'' Booth multiplier circuit. Many 

metic in which multiple data elements are operated upon in alternative data widths may be handled in other embodi- 

parallel, in the manner of a vector process, by a plurality of '^f^*^- In addition, other embodiments may include multiple 

vector execution units. The vector processor 406 executes adders with each of the adders servicing a plurality of bits m 

both scalar operations and combined vector-scalar opera- ^.^^J^ word. Each of the individual adders may incorporate 

tions. The multiple data elements of the vector processor 406 50 ^hedescribed Booth multiplier circuit and operating method, 

are packed in a 288-bit vector which is computed at a rate ^"f claimed iS; , . .. . 

of thirty-two 8/9-bit fixed-point arithmetic operations, six- , ^' ^ ^irc^^t for performing multiplication 

teen 16-bit fixed-point arithmetic operations, or eight 32-bit ^^/^.^S*^ mampulation of a plurality of signals the plurality 

fixed-point or floating point arithmetic operations per cycle "'Snals mcludmg at least a multiplier, a multiplicand and 

(for example, 12.5 ns). Most 32-bit scalar operations are 55 ' ^""^''^ P'''''^^^' ^^"^ multiplier circuit comprising: 

pipelined at a rate of one instruction per cycle and most an adder having a first input terminal, a second input 

288-bil vector operations are pipelined at a rate of one terminal, and an output terminal; 

instruction per cycle. Load and store operations are over- a multiplicand register having an output terminal coupled 

lapped with arithmetic operations and are executed indepen- to the first input terminal of the adder; 

dently by separate load and store circuits. 60 a temporary shift register having an output terminal 

The cache subsystem 408 includes a data cache 414 (for coupled to the second input terminal of the adder and 

example, 5KB), an instruction cache 416 (for example, having an input terminal coupled to the output terminal 

2KB), and a cache ROM 418 (for example, 16KB) and of the adder; 

typically operates at the same speed as the vector processor a multiplier shift register having an input terminal coupled 

406 (80 MHz). In one embodiment, the cache subsystem 408 65 to receive least significant bits of the partial product 

includes 1 Kbyte of instruction storage and 1 Kbyte of data from the output terminal of the adder, and the input 

storage for the scalar processor 404, 1 Kbyte of instruction terminal of the multiplier shift register coupled to 
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receive the multiplier, the multiplier shift register hav- 
ing an output terminal; 

a Booth logic circuit having an input terminal coupled to 
the output terminal of the multiplier shift register and 
having an output terminal; 

a first switch coupled between the output terminal of the 
multiplicand register and the first input terminal of the 
adder and having a control terminal coupled to the 
output terminal of the Booth logic circuit, the control 
terminal selecting a signal applied to the adder from 
among a zero input signal, a plus multiplicand signal, 
and a minus multiplicand signal; and 

a second switch coupled between the temporary shift 
register output terminal and the second input terminal 
of the adder and having a control terminal coupled to 
the output terminal of the Booth logic circuit, the 
control terminal selecting a bit shift number determin- 
ing the number of bits a temporary signal is shifted for 
application to the adder, the shift number being selected 
from a 0 bit shift and a one bit right shift. 

2. A multiplier circuit according to claim 1 further com- 
prising: 

a third switch coupled between the output terminal of the 
adder and the input terminal of the temporary shift 
register and having a control terminal coupled to the 
output terminal of the Booth logic circuit, the control 
terminal selecting a bit shift number determining the 
number of bits a result signal from the adder is shifted 
for writing the result signal to the temporary shift 
register, the shift number being selected from a 1 bit 
right shift and a two bit right shift. 

3. A multiplier circuit according to claim 1 further com- 
prising: 

a fourth switch coupled between the output terminal of the 
adder and the input terminal of the multiplier shift 
register and having a control terminal coupled to the 
output terminal of the Booth logic circuit, the control 
terminal selecting an input bit designator designating 
whether the most significant bits applied to the multi- 
plier shift register include result signal bits from the 
adder or a result signal bit from the adder and a least 
significant bit of a signal in the temporary shift register. 

4. A multiplier circuit according to claim 1 further com- 
prising: 

a third input terminal of the adder; 

a fifth switch coupled to the third input terminal of the 
adder and having a control terminal coupled to the 
Booth logic circuit for selectively incrementing the 
adder by 0 or 1, 

5. A multiplier circuit according to claim 1 further com- 
prising: 

a sixth switch coupled to the input terminal of the mul- 
tiplier shift register and coupled to the input terminal of 
the temporary shift register for writing a multiplier 55 
value to the multiplier shift register and for writing a 
zero value to the temporary shift register on receipt of 
an initialize signal. 

6. A processor including a multiplier circuit according to 
claim 1. 60 

7. A multimedia system including a multiplier circuit 
according to claim 1. 

8. A multiplier circuit according to claim 1 wherein: 
the Booth logic circuit input terminal is coupled to the 

multiplier shift register to receive three bits of the 65 
multiplier including the most significant bit of a lower 
order group; and 
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the Booth logic circuit generates a plurality of control 
signals based on the two bits of the multiplier and the 
most significant bit of the lower order group, the 
control signals including: 

a select 0 signal; 

a select positive multiplicand signal; 

a select negative multiplicand signal; and 

a shift right one bit signal. 

9. A multiplier circuit for performing multiplication 
through manipulation of a plurality of signals, the plurality 
of signals including at least a multiplier, a multiplicand and 
a partial product the multiplier circuit comprising: 

an adder having a first input terminal, a second input 

terminal, and an output terminal; 
a plurality of shift registers including: 

a multiplicand register having an output terminal 
coupled to the first input terminal of the adder; 

a temporary shift register having an output terminal 
coupled to the second input terminal of the adder and 
having an input terminal coupled to the output ter- 
minal of the adder; and 

a multiplier shift register having an input terminal 
coupled to receive least significant bits of the partial 
product from the output terminal of the adder, and 
the input terminal of the multiplier shift register 
coupled to receive a shifted version the multiplier, 
the multiplier shift register having an output termi- 
nal; 

a shift logic coupled between the adder and the plurality 
of shift registers; and 

a Booth logic circuit having an input terminal coupled to 
the output terminal of the multipHer shift register and 
having an output terminal coupled to the shift logic, the 
Booth logic circuit controlling a two-bit Booth multi- 
plier operation designating multiplicand partial product 
operations including +0 and ±x partial product opera- 
tions and implementing ±2x partial product operations 
using the ±x multiplicand partial product operations 
while shifting data from the temporary shift register 
one bit right on application to the adder. 

10. A multiplier circuit according to claim 9 wherein: 
the Booth logic circuit input terminal is coupled to the 

multiplier shift register to receive two bits of the 
multiplier and to receive a most significant bit of a 
lower order group; and 
the Booth logic circuit generates a plurality of control 
signals based on the two bits of the multiplier and the 
most significant bit of the lower order group, the 
control signals including: 
a select 0 signal; 

a select positive multiplicand signal; 

a select negative multiplicand signal; and 

a shift right one bit signal. 

11. A multiplier circuit according to claim 9 wherein the 
shift logic includes: 

a first switch coupled between the output terminal of the 
adder and the input terminal of the temporary shift 
register and having a control terminal coupled to the 
output terminal of the Booth logic circuit, the control 
terminal selecting a bit shift number determining the 
number of bits a result signal from the adder is shifted 
for writing the result signal to the temporary shift 
register, the shift number being selected from a 1 bit 
right shift and a two bit right shift. 

12. A multiplier circuit according to claim 9 wherein the 
shift logic includes: 
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a second switch coupled between the output terminal of 
the adder and the input terminal of the multiplier shift 
register and having a control terminal coupled to the 
output terminal of the Booth logic circuit, the control 
terminal selecting an input bit designator designating 5 
" whether the most significant bits applied to the multi- 
plier shift register include result signal bits from the 
adder or a result signal bit from the adder and a least 
significant bit of a signal in the temporary shift register. 

13. A multiplier circuit according to claim 9 further lo 
comprising: 

a third input terminal of the adder; 

a third switch coupled to the third input terminal of the 
adder and having a control terminal coupled to the 
Booth logic circuit for selectively incrementing the 
adder by 0 or 1. 

14. A multiplier circuit according to claim 9 further 
comprising: 

a fourth switch coupled to the input terminal of the 
multiplier shift register and coupled to the output 
terminal of the temporary shift register for writing a 
multiplier value to the multiplier shift register and for 
writing a zero value to the temporary shift register on 
receipt of an initialize signal. ^5 

15. A processor including a multiplier circuit according to 
claim 9. 

16. A multimedia system including a multiplier circuit 
according to claim 9. 

17. A method of multiplying integer numbers in a pro- 
cessor having an adder, a multiplicand register, a temporary 
register, a multiplier shift register, and a Booth logic circuit, 
the method comprising: 

receiving least significant bits of a partial product in the 
multiplier shift register from the adder: 35 

receiving a shifted version of a multiplier in the multiplier 
shift register; 

testing two bits of data in the multiplier shift register and 
the most significant bit of a previous two-bit group to 
determine whether a partial product value of +0, ±x, or ^0 
±2x is designated; 

if a partial product value of +0 is designated, adding a zero 
data to a temporary data in the temporary register and 
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replacing the temporary register with the sum right 
shifted by two bits; 
if a partial product value of +x is designated, adding a 
multiplicand data in the multiplicand register to the 
temporary data in the temporary register and replacing 
the temporary register with the sum right shifted by two 
bits; 

if a partial product value of -x is designated, comple- 
menting the multiplicand data in the multiplicand 
register, incrementing the complemented multiplicand 
data by 1, and adding the complemented multiplicand 
data to the temporary data in the temporary register and 
replacing the temporary register with the sum right 
shifted by two bits; 

if a partial product value of +2x is designated, shifting the 
temporary data in the temporary register right one bit 
and adding the multiplicand data in the multiplicand 
register to the shifted temporary data in the temporary 
register and replacing the temporary register with the 
sum right shifted by one bit; 

if a partial product value of -2x is designated, shifting the 
temporary data in the temporary register right one bit, 
complementing the multiplicand data in the multipli- 
cand register, incrementing the complemented multi- 
plicand data by 1, and adding the complemented mul- 
tiplicand data to the shifted temporary data in the 
temporary register and replacing the temporary register 
with the sum right shifted by one bit; and 

shifting data in the multiplier shift register, the multipli- 
cand register and the temporary register two bits right, 
the low-order bits of the temporary register being 
moved into the high-order bits of the multiplier shift 
register and the low-order bits of the multiplier being 
discarded. 

18. A multiplier circuit in a processor for performing the 
method according to claim 17. 

19. A processor including a multiplier circuit in the 
processor for performing the method according to claim 17. 

20. A multimedia system including a multiplier circuit in 
a processor for performing the method according to claim 
17. 



09/30/2004, EAST version: 1.4.1 



